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DETAILED ACTION 
Status of Claims: 
Claims 1-7, 9-12, and 14-22 are pending. 
Claims 1-3, 7, 9-10, 12, 14-15, and 17-19 are amended. 

Continued Examination Under 37 CFR 1.114 

1 . A request for continued examination under 37 CFR 1.114, including the fee set 
forth in 37 CFR 1.17(e), was filed in this application after final rejection. Since this 
application is eligible for continued examination under 37 CFR 1.114, and the fee set 
forth in 37 CFR 1 .17(e) has been timely paid, the finality of the previous Office action 
has been withdrawn pursuant to 37 CFR 1.114. Applicant's submission filed on 
07/13/2009 has been entered. 

Claim Rejections - 35 USC § 103 

2. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

3. Claims 1-7, 9-12, and 14-22 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Goodman (US 5,999,929 A) in view of Winshell (US 2002/0099813 
A1). 
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Claim 1 

Goodman teaches a method performed by a computer system, the method 
comprising: 

retrieving, by a processor associated with the computer system, a first plurality of 
uniform resource locators (URLs), where one or more URLs of the first plurality of URLs 
include a parameter string (Column 5 Lines 1-4, "the spider 14 uses URLs to 
identify Web pages to be retrieved for analysis"); 

selecting , by a processor associated with the computer system, one or more 
parameters present in parameter strings of the first plurality of URLs : selecting, bv a 
processor associated with the computer system, a first URL from the retrieved first 
plurality of URLs, where the first URL includes the selected one or more parameters: 
generating , bv a processor associated with the computer system, a second plurality of 
different URLs having different parameter combinations of the one or more selected 
parameters (Column 6 line 62 - Column 7 line 5, "Since, as described above, a 
single Web page can be identified by different links, it will be desirable to 
determine when multiple links, which can be associated with the same and/or 
multiple classes, in fact identify the same Web page. To accomplish that 
operation, the Web page analyzer 15 converts each link to the canonical form, 
using URL "re-write rules" that it develops and stores in the URL re-write 
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rulebase 16B. The Web page analyzer 15 generates the re-write rules as it 
analyzes Web pages received from the spider 14"); 

retrieving , by a processor associated with the computer system, content using 
the first URL (Column 5 Lines 5- 9 "After the spider 14 receives a Web page for 
analysis, it caches the Web page locally within the link referral system 10 with 
other cached Web pages, retrieved earlier by the spider 14, for later bulk analysis 
by the web page analyzer 15"); 

retrieving , by a processor associated with the computer system, content using 
the second plurality of different URLs; comparing, by a processor associated with the 
computer system, the content retrieved using the first URL to the content retrieved using 
the second plurality of different URLs (Column 7 lines 41-50" In addition, the Web 
page analyzer can process each candidate URL generated by eliminating portions 
from the beginning of the World Wide Web address (such as candidate URL (1)) 
to eliminate portions from the end of the World Wide Web address, and each 
candidate URL generated by eliminating portions from the end of the World Wide 
Web address (such as candidate URLs (2), (3) and (4)) to eliminate portions from 
the beginning of the World Wide Web address"); identifying, based on the 
comparing, one of the parameter combinations, that, when present in a particular URL, 
results in retrieving content that is approximately the same as the content corresponding 
to the first URL, the identifying being performed by a processor associated with the 
computer system ; and generating, by a processor associated with the computer system , 
one or more URL rewrite rules based on the identified one of the parameter 
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combinations (Column 7-8 Lines 24-53, "In generating the URL re-write rules, the 
Web page analyzer 15 generally processes the URL from the outward most 
portions of the respective World Wide Web addresses, eliminating portions of the 
respective series, as defined by the separators, to determine candidate URLS. For 
the illustrative URL above, HTTP://www.netscape.com/ index.html", candidate 
URLs will generally include, for example, eliminating portions from the beginning 
of the World Wide Web address"). 

Goodman does not specifically disclose the parameter string comprising at least 
one parameter and a value associated with the at least one parameter . 

Winshell teaches in Paragraph [0033] "Any implementation will have a method 
for specifying a set of rewriting rules. Each rule will specify the path part of the URL to 
be matched in the original URL and the set of parameter names whose names and 
values are to be rewritten into the path part of the modified URL entry by the rewriting 
software" in order to "to determine the specific query parameters that are to be moved 
(Paragraph [0022]). 

It would have been obvious to one of ordinary skill in the art at the time of the 
invention to create the invention of Goodman reference to include "Any implementation 
will have a method for specifying a set of rewriting rules. Each rule will specify the path 
part of the URL to be matched in the original URL and the set of parameter names 
whose names and values are to be rewritten into the path part of the modified URL 
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entry by the rewriting software" as taught by Winshell in order to "determine the specific 
query parameters that are to be moved (Paragraph [0022]). 

Claim 2 

The modified Goodman teaches the method of claim 1 , where the second 
plurality of different URLs includes the first URL with no parameters, the first URL with 
one parameter of the one or more parameters , and the first URL with two or more of the 
one or more parameters (Column 7 Lines 24-28, "In generating the URL re-write 
rules, the Web page analyzer 15 generally processes the URL from the outward 
most portions of the respective World Wide Web addresses, eliminating portions 
of the respective series, as defined by the separators, to determine candidate 
URLS", and also see Column 7 Lines 28-50). 

Claim 3 

The modified Goodman teaches the method of claim 1 , further comprising: 

performing the selecting a first URL, the generating a second plurality of different 
URLs, the retrieving content using the first URL, the retrieving content using the plurality 
of URLs, the comparing the content, and the identifying one of the parameter 
combinations, for multiple different first URLs of the first plurality of URLs, each first 
URL including the one or more parameters; and 

generating the one or more URL rewrite rules for the identified one of the 
parameter combinations for each of the first URLs (See Claim 1 rejection). 
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Claim 4 

The modified Goodman teaches the method of claim 3, where the rewrite rules 
specify that parameters that do not occur in a threshold number of the identified one of 
the parameter combinations are to be removed (Column 8 Lines 30-33, "After 
generating the score, the Web page analyzer 15 will store the candidate re-write 
rule in the URL re-write rulebase 16B if the score is below a predetermined 
threshold value"). 

Claim 5 

The modified Goodman teaches the method of claim 1, wherein each rewrite rule 
applies to a particular web site or web host (Column 5 Lines 17-21, "To assist in the 
duplicate Web page consolidation operation, the Web page analyzer 15 develops 
the URL re-write rulebase 16B, which contains rules which are used by the Web 
page analyzer 15 to convert URLs to respective canonical forms"). 

Claim 6 

The modified Goodman teaches the method of claim 1, where the identified one 
of the parameter combinations includes a minimum number of parameters with respect 
other ones of the parameter combinations (Column 7 Lines 40-50, examples show 
removing portions from the "beginning" and "end" of the World Wide Web 
address without ever actually removing the first unique part of the URL). 

Claim 7 
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Goodman teaches a method, performed by a computer system, for converting a 
uniform resource locator (URL) into a canonical form of the URL, the method 
comprising: 

receiving a URL that refers to content and that includes a parameter string 
(Column 5 Lines 1-4, "the spider 14 uses URLs to identify Web pages to be 
retrieved for analysis"); 

selecting, by a processor of the computer system, a rewrite rule by receiving a 
plurality of URLs that each include a particular parameter string, where the particular 
parameter string includes a combination of the one or more parameters selected from 
the parameter string included in the received URL, and identifying parameters of the 
one or more parameters that do not result in retrieving substantially different content, 
when present in a URL (Column 7-8 Lines 24-53, "In generating the URL re-write 
rules, the Web page analyzer 15 generally processes the URL from the outward 
most portions of the respective World Wide Web addresses, eliminating portions 
of the respective series, as defined by the separators, to determine candidate 
URLS. For the illustrative URL above, HTTP://www.netscape.com/ index. html", 
candidate URLs will generally include, for example, eliminating portions from the 
beginning of the World Wide Web address"); 

applying, by a processor of the computer system , the rewrite rule to the URL by 
removing the parameters that do not contribute to content from the URL; and outputting 
the rewritten URL as the canonical form of the URL (Column 5 Lines 17-21, "To assist 
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in the duplicate Web page consolidation operation, the Web page analyzer 15 
develops the URL re-write rulebase 16B, which contains rules which are used by 
the Web page analyzer 15 to convert URLs to respective canonical forms"). 

Goodman does not specifically disclose the parameter string including one or 
more parameters and values associated with the one or more parameters . 

Winshell teaches in Paragraph [0033] "Any implementation will have a method 
for specifying a set of rewriting rules. Each rule will specify the path part of the URL to 
be matched in the original URL and the set of parameter names whose names and 
values are to be rewritten into the path part of the modified URL entry by the rewriting 
software" in order to "to determine the specific query parameters that are to be moved 
(Paragraph [0022]). 

It would have been obvious to one of ordinary skill in the art at the time of the 
invention to create the invention of Goodman reference to include "Any implementation 
will have a method for specifying a set of rewriting rules. Each rule will specify the path 
part of the URL to be matched in the original URL and the set of parameter names 
whose names and values are to be rewritten into the path part of the modified URL 
entry by the rewriting software" as taught by Winshell in order to "determine the specific 
query parameters that are to be moved (Paragraph [0022]). 

Claim 9 
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The modified Goodman teaches the method of claim 7, where the identifying 
parameters of the one or more parameters includes; retrieving first content 
corresponding to a first URL including a first combination of parameters; 

retrieving second content corresponding to a second URL including a second 
combination of parameters, where the first combination of parameters includes at least 
one parameter not included in the second combination of parameters; and 

identifying first content as substantially the same as the second content (Column 
8 Lines 1-9, "If the Web page analyzer 15 determines in step 2b that the URLs in 
the entry are not identical to each other, it (that is, the Web page analyzer 15) find 
the shortest substitution rule that textually rewrites the longer URL into the 
shorter URL. For example, the shortest rule to change 

http://www.netscape.com/index.html" to HTTP://netscape.com/index.html" is to 
replace "www." with "" (that is, delete "www."). This rule is now a "candidate" 
rewrite rule"). 

Claim 10 

The modified Goodman teaches the method of claim 9, where the second 
combination of parameters includes no parameters, an individual parameter of the first 
parameter combination, or a combination of two or more parameters of the first 
parameter combination (Column 7 Lines 24-28, "In generating the URL re-write 
rules, the Web page analyzer 15 generally processes the URL from the outward 
most portions of the respective World Wide Web addresses, eliminating portions 
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of the respective series, as defined by the separators, to determine candidate 
URLS", and also see Column 7 Lines 28-50). 

Claim 11 

The modified Goodman teaches the method of claim 7, where the rewrite rule 
applies to a particular web site or web host (Column 5 Lines 17-21, "To assist in the 
duplicate Web page consolidation operation, the Web page analyzer 15 develops 
the URL re-write rulebase 16B, which contains rules which are used by the Web 
page analyzer 15 to convert URLs to respective canonical forms"). 

Claim 12 

Goodman teaches one or more devices comprising: 

at least one fetch bot to download content on a network from 

locations specified by uniform resource locators (URLs) (Column 4 Lines 60-65, 

"spider"); 

a content manager to extract URLs from the downloaded 
content (Column 5 Lines 5-10, "Web page analyzer"); 

a rewrite component to receive a URL that refers to content and that includes a 
parameter string , apply a predetermined rewrite rule to the URL that removes the at 
least one parameter from the URL when the at least one parameter does not affect the 
content referred to by the URL, where the predetermined rewrite rule is determined by 
receiving a plurality of URLs that include parameter strings comprising combinations of 
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parameters, and identifying parameters in the parameter string that do not result in 
retrieving substantially different content, when present in a URL; and output the 
rewritten URL as the canonical form of the URL (Column 5 Lines 17-21, "To assist in 
the duplicate Web page consolidation operation, the Web page analyzer 15 
develops the URL re-write rulebase 16B, which contains rules which are used by 
the Web page analyzer 15 to convert URLs to respective canonical forms"); and a 
URL manager to store the canonical form of the URL (Column 5 Lines 30-33, "The 
Web page analyzer 15 stores information regarding the identifications for the 
various classes and the Web page assignment information in the link class 
database 17"). 

Goodman does not specifically disclose the parameter string including at least 
one parameter and a value associated with the at least one parameter . 

Winshell teaches in Paragraph [0033] "Any implementation will have a method 
for specifying a set of rewriting rules. Each rule will specify the path part of the URL to 
be matched in the original URL and the set of parameter names whose names and 
values are to be rewritten into the path part of the modified URL entry by the rewriting 
software" in order to "to determine the specific query parameters that are to be moved 
(Paragraph [0022]). 

It would have been obvious to one of ordinary skill in the art at the time of the 
invention to create the invention of Goodman reference to include "Any implementation 
will have a method for specifying a set of rewriting rules. Each rule will specify the path 
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part of the URL to be matched in the original URL and the set of parameter names 
whose names and values are to be rewritten into the path part of the modified URL 
entry by the rewriting software" as taught by Winshell in order to "determine the specific 
query parameters that are to be moved (Paragraph [0022]). 

Claim 14 

The modified Goodman teaches the one or more devices of claim 12, where the 
identifying 

parameters in the parameter string includes; retrieving first content corresponding to a 
first URL including a first combination of parameters; 

retrieving second content corresponding to a second URL including a second 
combination of parameters, where the first combination of parameters includes at least 
one parameter not included in the second combination of parameters; and identifying 
the first content as substantially the same as the second content (Column 8 Lines 1-9, 
"If the Web page analyzer 15 determines in step 2b that the URLs in the entry are 
not identical to each other, it (that is, the Web page analyzer 15) find the shortest 
substitution rule that textually rewrites the longer URL into the shorter URL. For 
example, the shortest rule to change http://www.netscape.com/index.html" to 
"HTTP://netscape.com/index.html" is to replace "www." with "" (that is, delete 
"www."). This rule is now a "candidate" rewrite rule"). 

Claim 15 
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The modified Goodman teaches the one or more devices of claim 14, where the 
second combination of parameters includes an individual parameter of the first 
parameter combination, or a combination of two or more parameters of the first 
parameter combination (Column 7 Lines 24-28, "In generating the URL re-write 
rules, the Web page analyzer 15 generally processes the URL from the outward 
most portions of the respective World Wide Web addresses, eliminating portions 
of the respective series, as defined by the separators, to determine candidate 
URLS", and also see Column 7 Lines 28-50). 

Claim 16 

The modified Goodman teaches the one or more devices of claim 12, where 
each rewrite rule applies to a particular web site or web host (Column 5 Lines 17-21, 
"To assist in the duplicate Web page consolidation operation, the Web page 
analyzer 15 develops the URL re-write rulebase 16B, which contains rules which 
are used by the Web page analyzer 15 to convert URLs to respective canonical 
forms"). 

Claim 17 

Goodman teaches a system comprising: 

one or more devices comprising means for receiving a first uniform resource 
locator (URL) including a parameter string, (Column 5 Lines 1-4, "the spider 14 uses 
URLs to identify Web pages to be retrieved for analysis"); 
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means for retrieving content corresponding to the first URL (Column 5 Lines 5- 
"After the spider 14 receives a Web page for analysis, it caches the Web page 
locally within the link referral system"); 

means for retrieving content corresponding to a plurality of URLs having 
different parameter combinations of the one or more parameters , where the one or 
more parameters are selected from the parameter string ; means for identifying the 
parameter combination from the plurality of URLs that corresponds to content that is 
approximately the same as the content corresponding to the first URL and that contains 
a minimum number of parameters compared to other parameter combinations; 
(Column 7-8 Lines 24-53, "In generating the URL re-write rules, the Web page 
analyzer 15 generally processes the URL from the outward most portions of the 
respective World Wide Web addresses, eliminating portions of the respective 
series, as defined by the separators, to determine candidate URLS. For the 
illustrative URL above, HTTP://www.netscape.com/ index. html", candidate URLs 
will generally include, for example, eliminating portions from the beginning of the 
World Wide Web address"); and 

means for generating one or more URL rewrite rules based on the 
identified parameter combination (Column 5 Lines 17-21, "To assist in the duplicate 
Web page consolidation operation, the Web page analyzer 15 develops the URL 
re-write rulebase 16B, which contains rules which are used by the Web page 
analyzer 15 to convert URLs to respective canonical forms"). 
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Goodman does not specifically disclose the parameter string includes one or 
more parameters and values associated with the one or more parameters . 

Winshell teaches in Paragraph [0033] "Any implementation will have a method 
for specifying a set of rewriting rules. Each rule will specify the path part of the URL to 
be matched in the original URL and the set of parameter names whose names and 
values are to be rewritten into the path part of the modified URL entry by the rewriting 
software" in order to "to determine the specific query parameters that are to be moved 
(Paragraph [0022]). 

It would have been obvious to one of ordinary skill in the art at the time of the 
invention to create the invention of Goodman reference to include "Any implementation 
will have a method for specifying a set of rewriting rules. Each rule will specify the path 
part of the URL to be matched in the original URL and the set of parameter names 
whose names and values are to be rewritten into the path part of the modified URL 
entry by the rewriting software" as taught by Winshell in order to "determine the specific 
query parameters that are to be moved (Paragraph [0022]). 

Claim 18 

Goodman teaches a computer-readable memory device including programming 
instructions executed by a processor, the programming instructions comprising: 

instructions for receiving a first uniform resource locator (URL) including_a 
parameter string, (Column 5 Lines 1-4, "the spider 14 uses URLs to identify Web 
pages to be retrieved for analysis"); 



Application/Control Number: 10/748,655 Page 17 

Art Unit: 2446 

instructions for retrieving content corresponding to the first URL (Column 5 
Lines 5- "After the spider 14 receives a Web page for analysis, it caches the Web 
page locally within the link referral system"); 

instructions for retrieving content corresponding to a plurality of URLs 
having different parameter combinations of the one or more parameters , where the one 
or more parameters are selected from the parameter string ; instructions for identifying 
the parameter combination from the plurality of URLs that corresponds to content that is 
approximately the same as the content corresponding to the first URL and that includes 
a minimum number of parameters (Column 7-8 Lines 24-53, "In generating the URL 
re-write rules, the Web page analyzer 15 generally processes the URL from the 
outward most portions of the respective World Wide Web addresses, eliminating 
portions of the respective series, as defined by the separators, to determine 
candidate URLS. For the illustrative URL above, HTTP://www.netscape.com/ 
index.html", candidate URLs will generally include, for example, eliminating 
portions from the beginning of the World Wide Web address"); and 

instructions for generating one or more URL rewrite rules based on the 
identified parameter combination (Column 5 Lines 17-21, "To assist in the duplicate 
Web page consolidation operation, the Web page analyzer 15 develops the URL 
re-write rulebase 16B, which contains rules which are used by the Web page 
analyzer 15 to convert URLs to respective canonical forms"). 
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Goodman does not specifically disclose the parameter string includes one or 
more parameters and values associated with the one or more parameters . 

Winshell teaches in Paragraph [0033] "Any implementation will have a method 
for specifying a set of rewriting rules. Each rule will specify the path part of the URL to 
be matched in the original URL and the set of parameter names whose names and 
values are to be rewritten into the path part of the modified URL entry by the rewriting 
software" in order to "to determine the specific query parameters that are to be moved 
(Paragraph [0022]). 

It would have been obvious to one of ordinary skill in the art at the time of the 
invention to create the invention of Goodman reference to include "Any implementation 
will have a method for specifying a set of rewriting rules. Each rule will specify the path 
part of the URL to be matched in the original URL and the set of parameter names 
whose names and values are to be rewritten into the path part of the modified URL 
entry by the rewriting software" as taught by Winshell in order to "determine the specific 
query parameters that are to be moved (Paragraph [0022]). 

Claim 19 

The modified Goodman teaches the system of claim 17, where the different 
parameter combinations comprise an individual parameter of the one or more 
parameters, or a combination of two or more parameters of the one or more parameters 
(Column 7 Lines 24-28, "In generating the URL re-write rules, the Web page 
analyzer 15 generally processes the URL from the outward most portions of the 
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respective World Wide Web addresses, eliminating portions of the respective 
series, as defined by the separators, to determine candidate URLS", and also see 
Column 7 Lines 28-50). 

Claim 20 

The modified Goodman teaches the computer-readable memory device of claim 
18, where the instructions for receiving a first URL, the instructions for retrieving content 
corresponding to the first URL, the instructions for retrieving content corresponding to a 
plurality of URLs, and the instructions for identifying the parameter combination are 
performed for multiple first URLs, each first URL including the one or more parameters 
(See claim 18 rejection), and where the one or more URL rewrite rules specify that 
parameters that do not occur in a threshold number of the identified parameter 
combinations are to be removed (Column 8 Lines 30-33, "After generating the score, 
the Web page analyzer 15 will store the candidate re-write rule in the URL re-write 
rulebase 16B if the score is below a predetermined threshold value"). 

Claim 21 

The modified Goodman teaches the system of claim 17, further comprising: 
means for determining whether the content that corresponds to the plurality of URLs is 
approximately the same as the content that corresponds to the first URL using a 
similarity hash function (Hash function is a well known function for comparing 
documents. Applicant admits in paragraph [0041] of specification "A document 
having "approximately the same content" as another document may be 
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determined using any of a number of known document comparison techniques, 
such as comparison techniques based on a similarity hash"). 

Claim 22 

The modified Goodman teaches the computer-readable memory device of claim 
18, where the rewrite rules specify that parameters that do not occur in a threshold 
number of the identified parameter combinations are to be removed (Column 8 Lines 
30-33, "After generating the score, the Web page analyzer 15 will store the 
candidate re-write rule in the URL re-write rulebase 16B if the score is below a 
predetermined threshold value"). 

Response to Arguments 

4. Applicant's arguments with respect to claims 1-7, 9-12, and 14-22 have been 
considered but are moot in view of the new ground(s) of rejection. 

Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to FARHAD ALI whose telephone number is (571)270- 
1920. The examiner can normally be reached on Monday thru Friday, 7:30am to 
5:00pm. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Jeffrey C. Pwu can be reached on (571) 272-6798. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 
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